Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis

نویسندگان

  • Ling-Hui Chen
  • Yoshihiko Nankaku
  • Heiga Zen
  • Keiichi Tokuda
  • Zhen-Hua Ling
  • Li-Rong Dai
چکیده

In standard approaches to hidden Markov model (HMM)-based speech synthesis, window coefficients for calculating dynamic features are pre-determined and fixed. This may not be optimal to capture various context-dependent dynamic characteristics in speech signals. This paper proposes a data-driven technique to estimate the window coefficients. They are optimized so as to maximize the likelihood of trajectory HMMs given data. Experimental results show that the proposed technique can achieve a comparable performance with the meanand variance-updated trajectory HMMs in the naturalness of synthesized speech, while offering significantly lower computational cost.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

HMM based Automatic Speech Recognition Analysis

This project's 'HMM Based Automatic Speech Recognition Analysis main motive is just to generate an Automatic speech recognition which is clear an accurate using Hidden Markov Model (HMM) to get accurate results at number of frequency ranges related to human voice. Here is a record of 12 different words which is recorded by using a number of different speakers that includes male and female both ...

متن کامل

Improved Linear Predictive Coding Method for Speech Recognition

In this paper, improved Linear Predictive Coding (LPC) coefficients of the frame are employed in the feature extraction method. In the proposed speech recognition system, the static LPC coefficients + dynamic LPC coefficients of the frame were employed as a basic feature. The framework of Linear Discriminant Analysis (LDA) is used to derive an efficient and reduced-dimension speech parametric s...

متن کامل

An HMM-Based Approach to Flexible Speech Synthesis

The increasing availability of large speech databases makes it possible to construct speech synthesis systems, which are referred to as corpusbased, data-driven, speaker-driven, or trainable approach, by applying statistical learning algorithms. These systems, which can be automatically trained, not only generate natural and high quality synthetic speech but also can reproduce voice characteris...

متن کامل

An experimental HMM-based postal OCR system

It is almost universally accepted in speech recognition that phoneor word-level segmentation prior to recognition is neither feasible nor desirable, and in the dynamic (pen-based) handwriting recognition domain the success of segmentation-free techniques points to the same conclusion. But in image-based handwriting recognition, this conclusion is far from being firmly established, and the resul...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011